# Native Pretraining
Internvl3 8B Instruct GGUF
Apache-2.0
InternVL3-8B-Instruct is an advanced multimodal large language model (MLLM) that demonstrates exceptional overall performance, with strong multimodal perception and reasoning capabilities.
Text-to-Image
Transformers

I
unsloth
2,412
1
Internvl3 14B Instruct GGUF
Apache-2.0
InternVL3-14B-Instruct is an advanced Multimodal Large Language Model (MLLM) that demonstrates exceptional multimodal perception and reasoning capabilities, supporting various tasks such as tool usage, GUI agents, industrial image analysis, and 3D visual perception.
Image-to-Text
Transformers

I
unsloth
982
1
Internvl3 78B Pretrained
Other
InternVL3-78B is an advanced multimodal large language model developed by OpenGVLab, demonstrating exceptional comprehensive performance. Compared to its predecessor InternVL 2.5, it possesses stronger multimodal perception and reasoning capabilities, extending its abilities to new domains such as tool usage, GUI agents, industrial image analysis, and 3D visual perception.
Text-to-Image
Transformers Other

I
OpenGVLab
22
1
Featured Recommended AI Models